Phone boundary detection using sample-based acoustic parameters
نویسندگان
چکیده
A sample-based phone boundary detection algorithm is proposed in this paper. Some sample-based acoustic parameters are first extracted in the proposed method, including six sub-band signal envelopes, sample-based KL distance and spectral entropy. Then, the sample-based KL distance is used for boundary candidates preselection. Last, a supervised neural network is employed for final boundary detection. Experimental results using the TIMIT speech corpus showed that EERs of 13.2% and 15.1% were achieved for the training and test data sets, respectively. Moreover, 43.5% and 88.2% of boundaries detected were within 80and 240-sample error tolerance from manual labeling results at the EER operating point.
منابع مشابه
A Two-Stage Sample-Based Phone Boundary Detector Using Segmental Similarity Features
In this paper, a two-stage sample-based phone boundary detection algorithm is proposed. In the first stage, some local sample-based acoustic parameters are used to pre-select some phone boundary candidates. Then, in the second stage, some high-order statistics of the log-likelihood differences of two adjacent speech segments around each boundary candidate are calculated to serve as similarity m...
متن کامل高解析度之國語類音素單元端點自動標示 (Sample-based Phone-like Unit Automatic Labeling in Mandarin Speech) [In Chinese]
This paper presents a sample-based phone boundary detection algorithm which can improve the accuracy of phone boundary labeling in speech signal. In the conventional phone labeling method adopted the frame-based approach, some acoustic features, like MFCCs, are used. And, the statistical approaches are employed to find the phone boundary based on these frame-based features. The HMM-based forced...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملPrecise Phone Boundary Detection using Selective Context-dependent Acoustic Refinement
This paper proposes an automatic method for locating phone boundaries in speech utterances based on HMMbased forced alignment together with some context-dependent refinements. HMM-based forced alignment has been a preferred method for speech segmentation in many applications. However, the resulting boundaries are usually not consistent with real boundaries, defined based on abrupt changes in ac...
متن کاملPhone boundary detection using selective refinements and context-dependent acoustic features
Accurate placement of phone boundaries results in better performance of speech recognition systems as well as in the quality of concatenative speech synthesis. This study proposes a post-processing technique to refine the locations of phone boundaries provided by HMM-based forced alignment. The context-dependent Linear Discriminant Analysis (LDA) classifiers together with a confidence scoring s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010